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Abstract — It is shown that several classical open problems 
in linear system theory, such as computation of stabilizing 
static output feedback gains and optimization of decentralized 
controllers, can be readily solved for positive systems using linear 
programming. In particular, optimal solutions can be verified for 
large-scale systems using computations that scale linearly with 
the number of interconnections. The restriction to positive state 
space models is natural in many applications, including stochastic 
systems, transportation networks and economics. Moreover, the 
main results are generalized to frequency domain representations 
using the notion of positively dominated system. 



I. Introduction 

Classical methods for multi-variable control, such as LQG 
and i/oo -optimization, suffer from a lack of scalability that 
make them hard to use for large-scale systems. The difficulties 
are partly due to computational complexity, but also absence 
of distributed structure in the resulting controllers. Complexity 
growth can be traced back to the fact that stability verification 
of a linear system with n states generally requires a Lyapunov 
function involving n^ quadratic terms, even if the system 
matrices are sparse. In this paper we will see that the situation 
improves drastically if we restrict attention to closed loop 
dynamics described by system matrices with nonnegative 
off-diagonal entries. Then stability and performance can be 
verified using a Lyapunov function with only n linear terms. 
Sparsity can be exploited in performance verification and 
even synthesis of distributed controllers can be done with a 
complexity that grows linearly with the number of nonzero 
entries in the system matrices. These observations have far- 
reaching implications for control engineering: 

1) The conditions that enable scalable solutions hold nat- 
urally in many important application areas, such as 
stochastic systems, economics, transportation networks, 
chemical reactions, power systems and ecology. 

2) The essential mathematical property can be extended 
to frequency domain models . A sufficient condition 
is that the transfer functions involved are "positively 
dominated". 

3) In control applications, the assumption of positive dom- 
inance need not hold for the open loop process. Instead, 
a large-scale control system can often be structured into 
local control loops that give positive dominance, thus 
enabling scalable methods for optimization of the global 
performance. 

The paper is structured as follows: Sections ffnillll introduce 
background literature and notation. Stability criteria for posi- 
tive systems are cited in section |IV] These results are not new, 
but stated on a form convenient for later use and explained 
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with emphasis on scalability. Section |V] extends the stability 
results to input-output performance. The analysis results are 
then exploited in section [Vl] for synthesis of stabilizing and 
optimal controllers using distributed linear programming . Sec- 
tion [Vll] extends the techniques to positively dominated trans- 
fer functions. Section lVIIII explains how Lyapunov inequalities 
for positive systems can be verified using methods that scale 
linearly with the number of states and interconnections. Simi- 
lar methods are used in section|lX]to prove a general version of 
the Kalman-Yakubovich-Popov Lemma for positive systems. 

II. Background 

The study of matrices with nonnegative coefficients has a 
long history, dating back to the Perron-Frobenius Theorem 
in 1912. A classic book on the topic is |3|. The theory is 
used in Leontief economics ifTTl . where the states denote 
nonnegative quantities of commodities. Systems defined by 
nonnegative matrices (so called positive systems) appear in 
the study of Markov chains ll26l . where the states denote 
nonnegative probabilities and in compartment models flT], 
where the states could denote populations of species. A nice 
introduction to the subject is given in lITSl . 

A fundamental property of linear maps described by a 
positive matrix is that they are contractive in Hilbert's pro- 
jective metric ||4|, llT4l . This metric is closely related to the 
Lyapunov function max{a;i, . . . , a;„}— min{xi, . . . , a;„}, used 
for example in analysis of consensus algorithms ll26l . |J29|. For 
more recent contributions, see JST], ll27l . 

A nonlinear counterpart to positive systems is monotone 
systems, characterized by the property that a partial ordering 
of initial states is preserved by the dynamics. Such dynamical 
systems were studied in a series of papers by Hirsch [8|, 
|9|, showing that monotonicity generally implies convergence 
almost everywhere. Positive systems have also gained increas- 
ing attention in the control literature during the last decade. 
See for example ll30l . Q, (|T2|. Feedback stabilization of 
positive linear systems was studied in lfT6l . Il23l . ||22 I and basic 
control theory for nonlinear monotone systems was developed 
in |1 ). A recent result by Tanaka and Langbort [28] shows that 
decentralized controllers can be optimized for positive systems 
using semi-definite programming. The criterion is the closed 
loop Hoa norm and the authors show that diagonal quadratic 
storage functions can be used without conservatism. Some of 
the main results in this paper (first presented in [124] ). can be 
viewed as extensions of that work. 

III. Notation 

Let R+ denote the set of nonnegative real numbers. The 
inequality X > (X > 0) means that all elements of the ma- 
trix (or vector) X are positive (nonnegative). For a symmetric 





Fig. 1. Level curves of Lyapunov functions corresponding 

to the conditions {I]2), (T]3) and (T]4) in Proposition [T] 
If Ai; < 0, then V{x) = maxi(a;i/^i) is a Lyapunov function with 



rectangular level curves. li z A < 0, then V{x) = z x is a linear 
Lyapunov function. Finally if A^ P + PA -< and P >- 
V(x) = x^ Px is a quadratic Lyapunov function for the system x - 



0, then 
= Ax. 




Fig. 2. A graph of interconnected systems. In Example 1 the interpretation 
is a transportation network and each arrow indicates a transportation link. In 
Example 2 the interpretation is instead a vehicle formation and each arrow 
indicates the use of a distance measurement. 



matrix X, the inequality X )^ means that the matrix is 
positive definite. The matrix A S R"^" is said to be Hurwitz 
if all eigenvalues have negative real part. It is Schur if all 
eigenvalues are strictly inside the unit circle. Finally, the 
matrix is said to be Metzler if all off-diagonal elements are 
nonnegative. The notation H^^™ represents the set of n x m 
matrices whose entries are analytic functions in the right half 
of the complex plane. 



IV. Distributed Stability Verification 
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Proposition 1: Given a Metzler matrix A <E W 
following statements are equivalent: 
(1.1) The matrix A is Hurwitz. 

There exists a ^ e M" such that ^ > and A^ < 0. 

There exists a z € K." such that z > and z^ A < 0. 

There exists a diagonal matrix P ;^ such that 

A'^P + PA^O. 

The matrix —A^^ exists and has nonnegative entries. 
Moreover, if ^ = (^i, . . . , ^„) and z = {zi, . . . , Zn) satisfy 
the conditions of (1.2) and (1.3) respectively, then P — 
diag(zi/^i, . . . , Zn/^n) satisfies the conditions of (1.4). 
Remark 1. Each of the conditions ([I]2), ([T]3) and ([T]4) 
corresponds to a Lyapunov function of a specific form. See 
Figure [T] 

Remark 2. One of the main observations of this paper is that 
verification and synthesis of positive control systems can be 
done with methods that scale linearly with the number of 
interconnections. For stability, this claim follows directly from 
Proposition [T] Given ^, verification of the inequality A^ < 
requires a number of scalar additions and multiplications that 
is directly proportional to the number of nonzero elements in 
the matrix A. In fact, the search for a feasible ^ also scales 
linearly, since integration of the differential equation ^ = A^ 
with ^(0) = ^0 for an arbitrary ^o > generates a feasible 
^(t) in finite time provided that A is Metzler and Hurwitz. 

Proof of PropositionUl The equivalence between (1.1), (1.2), 
(1.4) and (1.5) is the equivalence between the statements G20, 
l27, H24 and iVas in fs! Theorem 6.2.3]. The equivalence be- 
tween (dll) and ([T]3) is obtained by applying the equivalence 
between ([T]l) and ([T]2) to the transpose of A. Moreover, if 
C = ("^ij . . ■ I (,n) and z = (21, . . . , Zn) satisfy the conditions of 
([T|2) and ^3) respectively, then P = diag(zi/^i, . . . , Zn/S,n) 
gives (A^P + PA)S, = A^z + PA^ < so the symmetric 
matrix A^ P + PA is Hurwitz and ([T]4) follows. D 



Example 1. Linear transportation network. Consider a 
dynamical system interconnected according to the graph il- 
lustrated in Figure |2] 
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(1) 

The model could for example be used to describe an 
transportation network connecting four buffers. The states 
2^1, 2^2, 2^3, 2:4 represent the contents of the buffers and the 
parameter iij determines the rate of transfer from buffer j 
to buffer i. Without such transfer the content of the second 
and third buffer would grow exponentially due to the diagonal 
elements 2 and 3, corresponding to unstable internal dynamics 

of those buffers. 

Notice that the dynamics can be written as i = Ax where 
A is a Metzler matrix provided that every iij is nonnegative. 
Hence, by Proposition [l] stability is equivalent to existence of 
numbers ^1 , . . . , ^4 > such that 
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Given these numbers, stability can be verified by a distributed 
test where the first buffer verifies the first inequality, the second 
buffer verifies the second and so on. In particular, the relevant 
test for each buffer only involves parameter values at the local 
node and the neighboring nodes, so a global model is not 
needed anywhere. D 

Example 2. Vehicle formation (or distributed Kalman 
filter). Another system structure, which can be viewed as a 
dual of the previous one, is the following: 

'xi = -xi +£i3(a;3 - xi) 

X2 = i2l{xi ~ X2) + i23{x3 ~ X2) 

is = ^32(2^2 - X3) + Iziixi - 2:3) 

^±4 = -4X4 + 43(2:3 - 2:4) 



(2) 



This model could for example be used to describe a platoon 
of four vehicles. The parameters iij represent position adjust- 
ments based on distance measurements between the vehicles. 
The terms —xi and —4x4 reflect that the first and fourth 
vehicle are equipped to maintain stable positions on their 
own, but the second and third vehicle rely on the distance 
measurements for stabilization. Again, stability can be verified 
by a distributed test where the first vehicle verifies the first 



inequality, the second vehicle verifies the second inequality 
and so on. D 

A discrete time counterpart to Proposition [T] can be stated 
as follows: 

Proposition 2: For B E M"^", the following statements are 
equivalent: 

(2.1) The matrix B is Schur stable. 

(2.2) There is a ^ e R" such that ^ > and Bf < ^ 

(2.3) There exists a z g R" such that z > and B^z < z. 

(2.4) There is a diagonal P ^ such that B^PB -< P. 

(2.5) (/ — B)^^ exists and has nonnegative entries. 
Moreover, if ^ — (^i, . . . , ^„) and z — {zi, . . . , z„) satisfy 
the conditions of (2.2) and (2.3) respectively, then P — 
diag(zi/^i, . . . , Zn/£,n) satisfies the conditions of (2.4). 
Proof. The equivalence between (|2]l) and ^5) is proved by 
13] Lemma 6.2.1]. Setting A — B — I gives the equivalence 
between (|2]2), (|2l3) and (|2]5) from the equivalence between 
([I]2), ^3) and ^5). 

Suppose ^ = (Ci, ■•-,?«) and z = (zi,...,z„) satisfy 
the conditions of ^2) and ^3) respectively. Set P = 
diag(zi/,^i, . . . , Zn/^n) and yk = V^kZk for /c = 1, . . . , n. 
Then 

p-U^B'^PBP-^^^y = p-^l'^B'^PBi < p-^/'^B^P^ 
- p-i/^B^z < p-i/2^ = y 

so p-^/^B^PBP-^/^ is Schur, p-^'^B^ PBP-^'^ < I 
and (l2]4) follows. Finally, (l2]4) shows that x^ Px is a positive 
definite Lyapunov function for the system x+ — Bx, so (|2]l) 
follows from ^4). D 

V. Input-Output Performance of Positive Systems 

We will now move beyond stability and discuss input-output 
performance. A connection between stability and performance 
is established by the following theorem: 

Theorem 3: Suppose that G(s) = C{sI~Ay^B + D with 
A e M"''" Metzler and B e Wl''\ C G R^""', D e R+. 
Define ||G||oo = sup^ |G'(iLL')|. Then the following statements 
are equivalent: 

(3.1) The matrix A is Hurwitz and ||G||oo < 7- 
'A B 
C D--f 



(3.2) The matrix 



is Hurwitz. 



(3.3) There exists a diagonal P ;^ such that all solutions 
to i; = Ax + Bw satisfy 

d 



dt 



x{tY Pxit) + \Cx{t) + Dw{t)Y < 7^1^(01' 



(3) 



with strict inequality when {x{t),w{t)) ^ (0,0). 

(3.4) There exists p E R" such that p > and all solutions 
to i = Ax + Bw satisfy 

^ (p^|x(i)|) + \Cxit) + Dw{t)\ < ^\w{t)\ (4) 
with strict inequality when {x{t),w{t)) ^ (0,0). 



Remark 3. The equivalence between (l3]l) and (l3]3) was earlier 
proved in (281. 

Remark 4. An consequence of Theorem |3] is that the four 
equivalent conditions not only guarantee that 7 is a gain bound 
on G(s), but on every operator of the form C{I—AA)~^AB+ 
D where A(a;) — (Ai(xi), . . . , A„(x„)) and every A^ is a 
passive scalar operator In fact, (13) can be written as 

2x{tfP{Ax{t) + Bw{t)) + \Cx{t) + Dw{t)\'^ < 7^|w(i)P 

Integrating the inequality over t E [0, r] and using the passivity 
conditions L Xk{Axk + Bwk)dt > gives the desired gain 

bound J^ \Cx{t)+Dw{t)\^dt < j^ J^ \w{t)\'^dt. 

Proof of Theorem \3\ First note that the maximum 
max^ |G(iw)| must be attained at w = since 



\GiiLo) 



D 



'Ce^'B 



dt 



<D+ Ce'^^Bdt = D- CA-^B = G(0) 

Hence ||G||oo < 7 may equivalently be written 

D-CA^^B <7 

Suppose that ^l) holds. Then D - CA-^B < 7. By 
Proposition [T] there exists ^ > such that A^ < 0. Define 
a: = ^ — A^^B. Then x > since —A^^ > 0. Moreover 



Ax + B = AS, <0 
If ^ is sufficiently small, we also get Cx 
B 



D < 7 so 



A 
C 



< 



(5) 



and (l3]2) holds. Conversely, (l3]2) implies (|5]l for some x > 0, 
so A is Hurwitz and 



-A^^B < X 



D 



CA-^B < Cx 



P» <7 



Hence (l3]l) follows and equivalence between (l3]l) and (l3]2) 
has been established. 

Next assume that (l3]3) holds. For w ~ 0, the inequality (O 
implies ■^xit)'^ Px{t) < with strict inequality for x{t) =^ 0, 
so A must be Hurwitz. Integrating Q along solutions to i = 
Ax + Bw with a:;(0) = gives 



\Cx{t) + Dw{t)\^dt<-f^ 



\wit)\'^dt 



so |lG||oo < 7 and (l3]l) follows. The opposite implication, 
from (l3]l) to (l3]3), is known from [28] and is also a con- 
sequence of Theorem [141 to be proved in the last section of 
this paper: Condition (l3]l) implies (fl4ll) of Theorem [l4l with 
strict inequality for 

„ _ \C^C C^D ' 
^~ [DC D^-j^ 

Hence (fT4l3) of Theorem [T4l implies existence of a diagonal 
P ^ such that 



2x^P{Ax + Bw) + \Cx + Dw\^ < "f^lwl"^ 



with strict inequality for {x,w) 7^ (0,0). Condition ([3]3) 
follows and the equivalence between (l3]l) and (|3]3) is es- 
tablished. 

Assuming that (l3]4) holds, consider x{t) > 0, w{t) > and 
rewrite dD as 



p^{Ax + Bw) +Cx + Dw < jw 



Hence 



<0 



'A B 

C D-TJ 

and (l3]2) follows by Proposition [T] 

Finally, assuming (l3]2), we get (|6]l for some p > 0, so 

d 



(6) 



dt 



< 



{p'^\xit)\) + \Cxit) + Dwit)\ 



dx , , 



C\x{i)\+D\w{t)\ 



< p^ {A\x{t)\ + B\w{t)\) + C\x{t)\ + D\w{t)\ < 

with strict inequaUty when {x(t),w{t)) 7^ (0,0). This proves 
([3]4) and the proof is complete. D 

A discrete time version can be stated as follows: 
Theorem 4: Let G(z) = C{zl - A)-^B + D where 



A e 



\ B e 



,Ce 



and D e 



Define 



||G||oo =sup„|G(e^' 
equivalent: 

(4.1) The matrix A is Schur and ||G 



Then the following conditions are 



(4.2) The matrix 



00 < 7- 
is Schur. 



A B 

7-IC "f-^D 

(4.3) There exists a diagonal matrix P )^ such that 

\x{t + l)|p + \Cx{t) + Dw{t)f < \x{t)\l + 7^(^)1^ 
for all solutions to x{t + 1) = Ax{t) + Bw{t). 

(4.4) There exists a p E R" such that p > and 

p'^\x{t + 1)1 + \Cx{t) + Dw{t)\ < p^\x{t)\ + jlwit)] 
for all solutions to x{t + 1) = Ax{t) + Bw{t). 

A proof of Theorem|4]is given in the appendix. We conclude 
this section by pointing out that for scalar positive systems, 
all induced norms are equal: 

Theorem 5: For a scalar impulse response g{t) and w £ 
Lp[0, cx)), let g*w denote the convolution of 17 and w. Suppose 
that g{t) > and /^ g{t)dt < 00. Then the induced norm 
||g||p_i„d = sup„ Jl|^ satisfies 

poo 

||g||p-ind = / g{t)dt p€ [l,cx)] 

Jo 

Proof. It is well known that ||(7||2-ind = max^ |G(e'")| where 
G(s) = Jp g{t)e^^*^dt. When g{t) > 0, the maximum must 
be attained at w = since 



|G(iw)| 



9it)e- 



*dt 



< I g{t)dt ^GiO) 
'0 



Moreover 

\\ymi = 







g(t — T)w{T)dT 







dt 



nOO nt 

< / g{t - T)\w{T)\dTdt 

Jo Jo 

/•oo 



g{t-T)dt]\w{T)\dT 



g{t)dt] \\w 



with equality when w{t) > for all t. Similarly 



\yit)\ 



g{T)w{t — T)dT 



< I g{T)\w{t ~ r)|dr 
'0 

< 



g{T)dT \\w 



with equality if w is constant. Hence the desired equality 



bllp-ind = / 9{t)dt 
Jo 



(7) 



has been proved for p = 1, p — 2 and p — 00. The Riesz- 
Thorin convexity theorem flO, Theorem 7.1.12] shows that 
llffllp-ind is a convex function of p for 1 < p < 00, so (|7]) 
must hold for all p G [1, 00]. □ 



VI. Distributed Control Synthesis by Linear 
Programming 

Equipped with scalable analysis methods for stability and 
performance, we are now ready to consider synthesis of 
controllers by distributed optimization. We will start by re- 
visiting an example of section |IV] 

Example 3 Consider again the transportation network ([!}, this 
time with the flow parameters £31 = 2, £34 — 1 and £43 — 2 
fixed: 



Xi 




±2 




±3 




X4_ 





-3 £12 

2 - £12 - 

2 42 





1^32 





^3 

1-^3 1 

2 -5 



(8) 



We will ask the question how to find the remaining parameters 
^12, ^3 and £32 in the interval [0, 10] such that the closed 
loop system ([8]) becomes stable. According to Proposition [1] 
stability is equivalent to existence of ^1 , . . . , ^4 > such that 



-3 42 

2-42-^2 ^3 

2 ^2 1-^3 1 

2-5 



<0 



At first sight, this looks like a difficult problem due to multi- 
plications between the two categories of parameters. However, 
a closer look suggests the introduction of new variables: 
M12 := ^126. M32 := ^326 and /i23 := ^23C3- The problem 



then reduces to linear programming: Find (,1,^2, £,3, £,4 > 
and Aii2,A'32,/^23 > such that 
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<0 



M12 < 106 /^32 < 106 Ai23 < 106 

with the solution (6,6,6,6) = (43,12.8,10.1,4.2) and 
(^12,^32,^23) = (128,0,101). The corresponding stabilizing 
gains can then be computed as 

^12 = A'12/6 = 10 ^32 = /^32/6 = ^23 = M23/6 = 10 

D 

The idea can be generalized into the following theorem: 
Theorem 6: Let the matrices A € M"^", £; G M"""", F e 
R™^", K e R™x™ be given and let V be the set of m x m 
diagonal matrices with entries in [0, 1]. Suppose that (/ — 
LK)-'^ exists and A + E{I - LK)-^LF is Metzler for all 
L E v. If F and K have nonnegative coefficients, then the 
following two conditions are equivalent: 

(6.1) There exists LeV such that A + E{I - LK)-^LF 
is Hurwitz. 

(6.2) There exist ^ e M!^, /i e WJ^" with fJ. < F£ + Kfi 
and A£, + Efj, < 0. 

Alternatively, if E and K have nonnegative coefficients, then 
(6.1) is equivalent to 

(6.3) There exist p £ M!^, q e W^ with q < E'^p + K'^q 
and A^p + F'^q < 0. 

Remark 5. If the diagonal elements of V are restricted to IR+ 
instead of [0,1], then the condition n < F£ + Kfi is replaced 
by < F^ + Kfi. 

Remark 6. Each row of the vector inequalities can be verified 
separately to get a distributed test. 

Remark 7. It is natural to compare the expression A + E{I — 
LK)~^LF with the "state feedback" expression A + BL of 
standard linear quadratic optimal control. A major difference 
is the presence of F and K which make the optimization 
into a problem of "static output feedback" rather than state 
feedback. Another difference is the diagonally structured L 
instead of a full matrix. The diagonal structure gives a much 
higher degree of flexibility, particularly in the specification of 
distributed controllers. 

Proof of Theorem |6] Suppose (|6]l) holds. Let A + E{I — 
LK)-^LF be Hurwitz and define £,&Wl with [A + E{I - 
LK)-^LF]£ < 0. Let /i = (/ - LK)-^LF£. Then /i = 
L{F£ + A» and A£ + Eii={A + E{I - LK)-^LF)£ < 0. 
Conversely, suppose that (|6]2) holds. Choose L E T> to get 
/!= {I -LK)-^LF£. Then 

[A + E{I - LK)-^LF]£ = A^ + Efi<0 

so A + E{I — LK)^^LF is Hurwitz. The equivalence between 
dUl) and (|6]3) follows immediately by replacing A + E{I — 
LK)'^LF with its transpose. D 



Combining Theorem [3] with Theorem |6] also gives a linear 
programming formulation of the problem to minimize input- 
output gain: 

Corollary 7: Let T) be the set of m x m diagonal matrices 
with entries in [0, 1]. Suppose that D is scalar and that A + 
ELF is Metzler for all L e X>. 

If the matrices B,C,D and F have nonnegative coefficients, 
then the following two conditions are equivalent: 

(7.1) There exists L eV such that A + ELF is Hurwitz 
and 

\\C[sI-{A + ELF)]-^B + D\\^<-1. (9) 

(7.2) There exist ^ G K'^, /x e W^ with 

Ae + F/i + B<0 C£ + D <-^ 1^<F£ 
If 6/^ satisfy (7.2), then (7.1) holds for every L such that 

^I = LFC 

Alternatively, if B,C,D and E are nonnegative, then (7.1) 
is equivalent to 

(7.3) There exist peR^, qe K™ with 

A^p + F^q + C^ < B'^p + Z? < 7 q< E'^p 

If p,q satisfy (7.3), then (7.1) holds for every L such that 

q = LE^p. 

Remark 8. It is interesting to compare Corollary |7] with 
the corresponding synthesis method proposed in ||281 . which 
exploits ([T]4) of Proposition [T] as criterion for stability rather 
than ([r|2) and ([T]3). An advantage of the approach in 1.2 8 J 
is that the Metzler property of the closed loop system matrix 
can be enforced in the synthesis procedure as a constraint, 
rather than being verified a priori for all L E V. On the other 
hand, the linear programming approach proposed here has a 
simpler structure, where the distributed and scalable nature 
of the conditions is apparent. Tanaka and Langbort do not 
discuss the scalability aspects of their approach in ll28l . but a 
scalable decomposition of their conditions will be developed 
in Theorem [12] of this paper 

Proof. According to Theorem |3] condition (Hi) holds if and 
only if there exists £ G K" with 



A 



-ELF 
C 



<0 



(10) 



B 

Given ([Toll, the inequaUties of ^2) hold with ^ = LF£. Con- 
versely, given (l7]2), the inequalities of (fTOl i follow provided 
that /i = LF£. This proves the desired equivalence between 
dZll) and (02). The equivalence between Ql) and (|7]3) 
follows immediately by replacing G{s) with its transpose. D 

Example 4. Disturbance rejection in vehicle formation. 

Consider the vehicle formation model 

-xi +^13(2:3 - Xi) +Wi 

l2l{xi -X2) +43(2^3 -X2) +W2 
42(2:2 - 2:3) + ^4(2:4 - 2:3) + W3 

-4x4 + ^3(2:3 — 2:4) + wa 



(11) 



where wi, . . . , W4 are external disturbances acting on the ve- 
hicles. Our problem is to find feedback gains gains lij G [0, 1] 



that stabilize the formation and minimize the gain from w to 
X. The problem can be solved by applying linear programming 
to condition (|2l2) with 



A = diag{-l,0,0,-4} 
C= (1 1 1 1 ) 



£' = 



K = 



E = 



/ 1 \ 

110 

110 

\ 1 / 



B = 



1 
1 



L = diag{£i3 , £21 , hs , ^32 , ^34 , hs } 





(-1 


1 


\ 






1 -1 










F = 


-1 
1 


1 
-1 













-1 


1 






i^ 


1 


-^ J 




The optimal L = diag{0, 


1,1,0,1,0} gives 7 = 4.125. 


If we instead put B 


= ( 


10 10 1 1 )^, the minimal 


value 7 = 


15.562 is 


attained wit 


1 L = diagjl, 1,1, 0,1,0}. 



Conversely, B — (^ 1 1 10 10 ) gives the minimal 
value 7 = 12.750 for L = diagjO, 1, 0, 1, 1, 0}. D 

A discrete time counterpart to Theorem |6] is given without 
proof: 

Theorem 8: Let T) be the set of m x m diagonal matrices 
with entries in [0,1]. Given A e M"^",S e M"^™,F e 
W^'^",K e R"^™, suppose that (/ - LK)-^ exists and 
A + E{I - LK)-^LF is non-negative for all L e V. If F 
and K have nonnegative coefficients, then the following are 
equivalent: 

(8.1) A + E{I - LK)-^LF is Schur for some LeV. 

(8.2) There exist ^ e Kf:, ^ e M!^ with n < FC + K^ 
and ^^ + ^^ < ^. 

Alternatively, if E and K have nonnegative coefficients, then 
(8.1) is equivalent to 



(8.3) There exist p e R!;:, q e 
and A^p + F'^q < p. 



I™ with q < E'^p + K^q 



VII. Positively Dominated Systems 

So far, the emphasis has been on state space models. 
However, for many applications input-output models are more 
natural as a starting point. In this section, we will therefore 
extend the main ideas of the previous sections to such models. 
First we need to define a notion of positivity for input-output 
models. One option would be to work with non-negative 
impulse responses like in Theorem |5] However, to verify for 
a given rational transfer function that the impulse response is 
non-negative has proved to be NP-hard! See [5 1 for the discrete 
time problem and [2] for continuous time. Instead we will use 
the following definition. 

A stable rational transfer matrix G is called positively 
dominated if every matrix entry satisfies |Gj7j(iaj)| < Gjj.(0) 
for all w e M. (More generally the term is used for G e H™^" 



if ||Gjfc||oo = liminfg_j.o Gjk{<^)). The set of all such matrices 
is denoted BH™^". The essential frequency inequality can be 
tested by semi-definite programming, since 



b{iuj) 



a{iuj) 



< 



m 

a(0) 



for uj £ 



if and only if the polynomial |a(i(u;)p6(0)^ — |6(ia;)pa(0)^ is 
a sum of squares. 

Some properties of positively dominated transfer functions 
follow immediately: 

Proposition 9: Let G,H e nW^'\ Then GH e DH^''" 
and aG + 6H e DH^""" when a, 6 G R+. Moreover ||G||oo = 
l|G(0)||. 

The following property is also fundamental: 

Theorem 10: Let G G DH;^''". Then (/- G)-i G nWt^'^ 
if and only if G(0) is Schur 

Proof. That (/ — G)^^ is stable and positively dominated 
implies that [/ — G(0)]~^ exists and is nonnegative, so G(0) 
must be Schur according to Proposition |2] On the other hand, 
if G(0) is Schur we may choose ^ G R+ and e > with 
G(0)^ < (1 - ()i- Then for every z G C" with < \z\ < ^ 
and s G C with Re s > we have 

|G(s)*2| <G(0)*|z| < (l-e)*|z| fori =1,2,3,... 

Hence X]fe!Lo G(^)*'^ ^^ convergent and bounded above by 



EZoGioY 



[I — G(0)] ^\z\. The sum of the series 



solves the equation [/ — G(s)] J2T=o G(s)*2: = z, so therefore 
Efelo G(s)*z = [/ - G(s)]-iz. This proves (/ - G)-^ is 
stable and positively dominated and the proof is complete. D 

Corollary |7] can now be generalized as follows: 
Theorem 11: Let T) be the set of mx m diagonal matrices 
with entries in [0, 1]. Suppose that B G DW^\ C G DH^'^", 
D G DHoo and A + ELF G DH'^""" for all LeV. 

If F G BH™^", then the following two conditions are 
equivalent: 

(11.1) There is L e V with (/ - A - ELF)-^ G BH^""" 
and ||C(/ - A - ELFj-^B + D||oo < 7. 

(11.2) There exist ^ e R+, M e R^ with 

A(0)^ + E(0)m + B(0) <£, 

C(0)e + D(0)<7 

F(0)e > /i 

If ^,/i satisfy (11.2), then (11.1) holds for every L such that 
M = LFm. 

Alternatively, if E G BW^"', then (11.1) is equivalent to 

(11.3) There exist peM.^, qeW^ with 

A{Ofp + FiOfq + C{Of<p 
B(0)^p + D(0) <7 

mfp > q 

If p,q satisfy (11.3), then (11.1) holds for every L such that 

q = LF,{0)^p. 



Proof. Theorem [10] shows that (fTTIl) holds if and 
only if A(0) - E(0)LF(0) is Schur and C[/ - A(0) - 
E(0)LF(0)]-iB(0) + D(0) < 7. According to Theorem g] 
this is true if and only if 



A(0) 



E(0)Lr(0) 
-iC(0) 



B(0) 

7-iD(0) 



(12) 



is Schur By Proposition |2] this is equivalent to existence of 
5 e W\_ such that 

^A(0) + E(0)LF(0) B(0) 
7-'C(0) 7-'D(0) 

This is equivalent to ([1112) if we set ^ 
desired equivalence between ([TTIl) and ([1112) in Theorem [TTI 
follows. The equivalence between ([TTIl) and ([TTl3) is obtained 
by replacing G{s) with its transpose. D 

Example 5. Formation of vehicles with inertia. In Exam- 
ple 3, the inputs and disturbances were supposed to have an 
immediate impact on the vehicle velocities, i.e. the inertia of 
the vehicles was neglected. Alternatively, a model that takes 
the inertia into account can be stated as follows: 



Theorem 12: A symmetric Metzler matrix with 2m non- 
zero off-diagonal entries is negative semi-definite if and only if 
it can be written as a sum of m negative semi-definite matrices 
where each term has only four non-zero entries. 

The proof of Theorem [12] will be based on the following 
minor modification of 113, Theorem 3.1]: 

Proposition 13 (Positive Quadratic Programming): 
Suppose Mq, . . . , Mk are Metzler and bi, . . . ,bK G K. Then 

max trace(MoX) 

X hO trace(MfcX) > bk 





1 




1 




max 


X MqX 




< 




xeRl 


x'^MkX > bk 
k^l,...,K 




- 


iF(0)C, so the 







1, 



.K 



(13) 



Xi 



3 



lij [Xj 



Xi 



-U, 



Wi 



1, 



,N 



where Ui is an external control force, Wi is a disturbance and 
iij is the spring constant between the vehicles i and j. Suppose 
that local control laws Ui — —kiXi~diX are given and consider 
the problem to find spring constants £ij £ [0,iij] that minimize 
the gain from wi to xi. 

The closed loop system has the following frequency domain 
description 



i^+djs + fc, +^% jA,(s) 



Similarly to Example 3, we write this on matrix form as 

X = {A + 'ELF)X + BW 

The transfer matrices B, E and A + ELF are positively 
dominated for all L E T> provided that di > h + J^j "^ij ■ 
Hence Theorem [TT] can then be appUed to find the optimal 
spring constants. Notice that iij and £ji must be optimized 
separately, even though by symmetry they must be equal at 
optimum. □ 

VIII. Scalable Verification of the Lyapunov 
Inequality 

In the preceding sections we have derived scalable con- 
ditions for verification of stability and optimality based on 
the linear inequalities in ([T]2) and ([T]3) of Proposition [T] 
The condition ([T]4) is also useful for applications, but it less 
obvious how to verify the Lyapunov inequality A^P+PA -< 
in a test that scales linearly with the number of non-zero 
entries of A. However, the following result can be used for 
this purpose: 



The value on the right hand side remains the same if the 
condition X ^^ is relaxed to X S X, where X is the set 
of symmetric matrices (xij) € R"^" satisfying xu > and 
a^f, < XiiXjj for all i,j. Moreover, if there exists a matrix X 
in the interior of X with trace(MfeX) > bk for every k, then 
the maximum of ST3[ is equal to the minimum of — J^k "^k^k 
over Ti, . . . , Ti<- > such that A/q + X^fe "^kMk di 0. 

Remark 9. The problem on the right of ([13) is always convex 
and readily solvable by semidefinite programming. The prob- 
lem on the left is generally not a convex program, since the 
matrices Mk may be indefinite. However, the maximization on 
the left is concave in {xf, . . . ,xfj 1 19|. This is because every 
product XiXj is the geometric mean of two such variables, 
hence concave |[6j p. 74]. 

Remark 10. The second statement of Proposition [13] is impor- 
tant for scalability, since the condition X £ X has a natural 
decomposition and only entries of X that correspond to non- 
zero entries of Mk need to be taken into account. 

Proof of Proposition [73] Every x satisfying the constraints on 
the left hand side of ST3[ corresponds to a matrix X = xx'^ 
satisfying the constraints on the right hand side. This shows 
that the right hand side of ( [T3l l is at least as big as the left. 

On the other hand, let X — (xij) be a positive definite 
matrix. In particular, the diagonal elements xn, . . . ,a;„„ are 
non-negative and Xij < y/xuXjj. Let x — (-y/xn, . . . , ^Jxnn)- 
Then the matrix xx"^ has the same diagonal elements as X, but 
has off-diagonal elements y/xuxjj instead of Xij . The fact that 
xx'^ has off-diagonal elements at least as big as those of X, 
together with the assumption that the matrices Mk are Metzler, 
gives x^ MkX > tra,ce{MkX) for k — I, . . . ,K. This shows 
that the left hand side of ([T3) is at least as big as the right. 
Nothing changes if X is not positive definite but X e X, so 
the second statement is also proved. 

For the last statement, note that the conditions 
trace(MfeX) > bk are linear in X, so strong duality 
holds pS Theorem 28.2] and the right hand side of ([TJt has 
a finite maximum if and only if Mq + J2k=i ^k^k di for 
some Ti, . . . ,tk > 0. □ 



Proof of Theorem [72] Let £ be the set of indices {k,l) of 
non-zero off-diagonal entries in M. Define 

X£ = {X £ M"^" : [ck eiYX[ek e/] > for all (/c, /) £ £] 




H 



Fig. 3. Illustration of the power transmission network described in Example 5 
and 6. Example 5 describes a purely resistive network under static conditions, 
while Example 6 treats a dynamic network with inductive transmission lines 
and capacitances at the nodes. 



where ei, . . . , e„ are the unit vectors in R". If M is negative 
semi-definite, then 



= max X Mx — max trace(MX) 

|a:|<l XeX£ 



(k,l)&£ 



mm max trace 



M+ ^ [eft ei]Nki[ek e;] 
, (k,i)e£ 



X 



where Nki € R for every k and /. In particular, 
there exists a choice of the matrices Nki that makes M + 
TliikDes^^k ei]Nki[ek ei]'^ = 0. This completes the proof. 

D 

To emphasize that Proposition [T3] is of independent appli- 
cation interest, we include an example from [TSl: 
Example 6. Optimal power flow in DC network. Consider 
a DC power transmission network as in Figure |3] All nodes 
are subject to constraints of the form ikVk < Pk- For gener- 
ating nodes pk represents the generator capacity. For power 
consuming loads ik and pk are negative and —pk represents 
the power demand. 

Every connection has a known admittance yjk = Vkj > 0. 
In particular, the current flowing from node 1 to node 2 equals 
2/12(^1 —V2)- Writing Kirchhoff's current law for all nodes in 
Figure |3] gives 

'ii = yuivi - V2) + 2/14(^1 - V4) 

i2 = 2/12(^2 - Vi) + 2/23(w2 - V3) + 2/24(^2 - V4) 

h = 2/23(^3 - V2) 

(14= 2/14(w4 - Vi) + 2/24(^4 - V2) 



(14) 



Suppose every link has a has a capacity bound c^ on the 
transferred power and every node has upper and lower bounds 



on the voltage according to u; 



111 in 

k 



< Vk < v" 



Then the 



problem to minimize the power losses in the network subject 
to constraints on power demands, voltage and link capacities 
can be written 

Minimize iivi + ■ • • + iNVN 
subject to (fT4l i with Vkik £ Pk 

for j,fc = l,...,Ar 

This is a quadratic optimization problem of the form handled 
by Proposition [T3] □ 



IX. The KYP Lemma for Positive Systems 

Input-output gain is certainly not the only way to quantify 
the performance of a linear time-invariant system. A more 
general class of specifications known as Integral Quadratic 
Constraints [20| can be tested using the Kalman-Yakubovich- 
Popov lemma. It is therefore of interest to see that the corre- 
sponding result of 1 28 1 for positive systems can be generalized 
the following way: 

Theorem 14: Let A G R"^" be Metzler and Hurwitz, while 
B e M"^™ and the pair {A,B) is controllable. Suppose that 
all entries of Q e R("+'") ^ {n+m) ^g nonnegative, except for 
the last m diagonal elements. Then the following statements 
are equivalent: 



— min max trace(MX) + >^ trace ( A^t; fei. eA X\eh eA] 



(14.1) For u) E [0,00] is is true that 



(14.2) 





\iujl - 


I 


^B 


* 

Q 


{iul ~ A)-^B 
I 


'-A-^B 
I 


* 

Q 


'-A-^B 
I 


^ 0. 


There exists a diagonal P )^ such that 




Q + 


A^P + PA PB 
B^P 


^0 



-<o 



(14.3) 



Moreover, if all inequalities are taken to be strict, then the 
equivalences hold even without the controllability assumption. 

Remark 11. For A = -I, B = Q, Q = [« J], condition dHl) 
holds, but not (fT4l3). This demonstrates that the controllability 
assumption is essential. 

Proof. One at a time, we will prove the implications 
([T4]l)=»([l4]2)^([Tl3)=^([lll). Putting u ^ Q immediately 
gives ([Tl2) from ([Til). 

Assume that (fT4l2) holds. The matrix —A^^ is nonnegative, 

so M ^g [,;^1 < for aU a; e M!^, w e W"_^ with 

X < -A-^Bw (15) 

The inequality (flST l follows (by multiplication with —A^^ 
from the left) from the constraint < Ax + Bw, which can 
also be written < AiX + BiW for i ~ 1, . . . ,n, where Ai 
and Bi denote the i:th rows of A and B respectively. For 
non-negative x and w, this is equivalent to 

< Xi{AiX + Biw) i = l,...,n (16) 



Hence (T^2) implies that [^1 Q 



< for x e 



w e 



ji^^ satisfying ( fTSI l. Proposition 1 1 31 will next be used to verify 
existence of ri , . . . , r„ > such that the quadratic form 

r -\T 



a{x,w) 



Q 



+ y TiXi{AiX + Biw) 



is negative semi-definite. However, the application of Propo- 
sition [13] requires existence of a positive definite X such that 
all diagonal elements of 



[A B]X 



are positive. The pair (A, B) is controllable, so there exists K 
that make all eigenvalues of A + BK unstable and therefore 
(A + BK)Z + ZiA + BK)^ = / has a symmetric positive 
definite solution Z. Hence the desired X can be constructed 
as 

" Z ZK^^ 
KZ * 



(15.1) For uj E [0,00] is is true that 



X = 



where the lower right corner is chosen big enough to make 
X^O. 

Define P = diag(ri, . . . , r„) >: 0. Then a being negative 
definite means that 

'A^P + PA PB 
B^P 

so ([Tl3) follows. 

Finally, assume that (fl4l3) holds. Integrating cr(a;(i), w(i)) 
over time gives 

T 



Q 



-<o 



o> 



Q 



+ x^PiAx + Bw) dt 



For square integrable solutions to i = Ax + Bw, x(0) — 
we get 



0> 



/ 


r -| 


T 








X 


Q 


X 


+ 


\ 


W 




w 




r tT r n 


xit) 


Q 


xit) 


wit) 


wit) 



+ —ix^Px/2) I dt 



dt 



which in frequency domain implies (fl4l l). Hence 
dH] I)=»([l4l2)=>([133)^([l4l 1). 

For strict inequahties, the proofs that (IT4l3)^(fT4ll)^(fT4l2) 
remain the same. Assuming that (fT4l2) holds with strict 
inequahty, we get 



-A-^B 
I 



iQ + (I) 



-A-^B 
I 



-<0 



for some scalar e > 0. Hence, there exists a diagonal P ^ 
such that 

'A^P + PA PB' 
B^P 



Q + el 



-<0 



Adding a small multiple of the identity to P gives P y such 
that 

'A'^P + PA PB' 
B^P 



Q 



^0 



so also (fl4l3) holds with strict inequality. Hence the proof is 
complete. D 

An analogous discrete time result is stated here and proved 
in the appendix: 

Theorem 15: Let A G KIj:''" be Schur, while B e K'j:'"" 
and the pair (A, B) is controllable. Suppose that all entries 
of Q e r("+™)x("+™) are nonnegative, except for the last 
m diagonal elements. Then the following statements are 
equivalent: 



'ie"^I-A)- 
I 


^B 


Q 


/ 


A)-^B' 


iI-A)-^B 
I 


Q 


\i- 


-A)-^B 
I 


^0. 



-<o 



(15.2) 



(15.3) There exists a diagonal P >{) such that 

\A^PA - P A^PB 
B^PA B^PB 



Q 



^0 



Moreover, if all inequalities are taken to be strict, then the 
equivalences hold even without the controllability assumption. 



X. Conclusions 

The results above demonstrate that the monotonicity prop- 
erties of positive systems and positively dominated systems 
bring remarkable benefits to control theory. Most important 
is the opportunity for scalable verification and synthesis of 
distributed control systems with optimal input-output perfor- 
mance. In particular, the linear programming solution comes 
with certificates that enable distributed and scalable verifica- 
tion of global optimality, without access to a global model 
anywhere. 

Many important problems remain open for future research. 
Here are two examples: 

• How can the scalable methods for verification and syn- 
thesis be extended to monotone nonlinear systems? 

> How can the optimization of static controller gains in this 
paper be extended to scalable methods for optimization 
of dynamic controllers? 
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XII. Appendix 

Proof of Theorem H] First note that the maximum 
maxtj |G(e*")| must be attained at a; = since 



|G(e-)| = 



D + e""^' ^ Ce-'-^'^A^B 



t=o 



<D + Y^ CA*B = D + C{I- A)-^B = G(l) 



t=o 



Hence ||G||oo < 7 may equivalently be written 

C{I-A)-^B + D<-/ 

Suppose that dUl) holds. Then C{I - A)-^B + Z? < 7. By 
Proposition |2] there exists ^ > such that A£, < ^. Define 



X 

Moreover 



^+ {I - A)-^B. Then a; > since (/ - A)-^ > 0. 



x~Ax = ^-A^ + B>B 
If ^ is sufficiently small, we also get Cx + D < 7 so 



A B 



< 



(17) 



and (|4l2) holds. Conversely, (|4]2) implies (fTTI i for some x > 0, 
so A is Schur and 

(I-A)-'^B<x C{I-A)-^B + D<Cx + D<j 

Hence (IDl) follows and equivalence between (|4]l) and ^2) 
has been established. 

Next assume that (|4]3) holds. For w = 0, the inequality 
implies |x(i+ l)|p < |a;(i)|p with strict inequality for x{t) 7^ 
0, so A must be Schur Summing the inequality in (|4]4) along 
solutions to x{t + 1) = Ax(t) + Bw{t) with a;(0) — gives 

T T 

J2 |Ca;(i) + Dw{t)fdt < 7^^ |w(t)|2di 



t=o 



t=o 



so ||G||oo < 7 and (Hjl) follows. For the opposite implication, 
assume that (012) holds. Then Proposition |2] proves existence 
of a diagonal P ;^ such that 



A B 



-\T r 



P 

72 



'A B 

7-1(7 ^-^D 



-< 



P 

72 



Multiplying with [x^ uF] from the left and its transpose from 
the right gives 

{Ax + Bw)'^P{Ax + Bw) + {Cx + Dwf < x'^ Px + j^w'^ 

with strict inequality for {x,w) 7^ (0,0). Condition (013) 
follows and the equivalence between (01l)<;=>(0l2)<=>(0l3) has 
been established. 

Assuming that (0l4) holds, consider x{t) > 0, ■w{t) > and 
rewrite the inequality as 

p^{Ax + Bw) + Cx + Dw < p^x + 7W 



Hence 



A B 

7-1(7 7-i£' 



-iT 


P/l 
1 


< 


P/l 
1 



(18) 



and (0l2) follows by Proposition |2l 

Finally, assuming (0l2), Proposition |2l gives ( fTsT i for some 
p > 0, so 

p^\x{t + 1)\ + \Cx{t) + Dw{t)\ 
< p^ {A\x{t)\ + B\w{t)\) + C\x{t)\ + D\w{t)\ 
<p'^\x{t)\+-f\w{t)\ 

with strict inequality when {x{t),w{t)) ^ (0,0). This proves 
(0l4) and the proof is complete. D 
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Proof of Theorem [75] The theorem can be proved in analogy 
with the proof of Theorem [14] Alternatively, it can be derived 
from Theorem [14] using a bilinear transformation in the 
following way: 

Instead of e*", one can parametrize the unit circle as i[^'" . 
Hence JTSll) is equivalent to saying that 



< 



for all solutions (w, x, w) to the equation 

1 + lUJ 



I - A\x = Bw 



1 — icu 
Alternatively, introducing 

A = {A-I){A + I)-^ 
B^2{A + I)-^B 

X ^ X + Ax + Bw 

„_r(A + /)-i -{A+i)-^B 

*^" [ / 

the condition can be re-written as the statement that 



Q 



<o 



for all solutions (cj, x, w) to the equation [iujl — A)x = Bw. 
According to Theorem [14] this is equivalent to existence of a 
diagonal P >Q such that 



A^P + PA PB 
B^P 



-<0 



Multiplying by 



A + I B 
/ 



from the right and its transpose 

from the left, the matrix inequality (after trivial manipulations) 
becomes 

'A^PA - P A^PB 
B^PA B^PB 



Q 



-<0 



Replacing 2P by P makes the proof complete. 



D 



